Dataset statistics
| Number of variables | 41 |
|---|---|
| Number of observations | 683788 |
| Missing cells | 221365 |
| Missing cells (%) | 0.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 213.9 MiB |
| Average record size in memory | 328.0 B |
Variable types
| Numeric | 14 |
|---|---|
| DateTime | 1 |
| Categorical | 11 |
| Text | 6 |
| Boolean | 9 |
state has constant value "" | Constant |
block_id is highly overall correlated with borocode and 2 other fields | High correlation |
boro_ct is highly overall correlated with borocode and 7 other fields | High correlation |
borocode is highly overall correlated with block_id and 8 other fields | High correlation |
boroname is highly overall correlated with block_id and 4 other fields | High correlation |
cb_num is highly overall correlated with boro_ct and 7 other fields | High correlation |
cncldist is highly overall correlated with boro_ct and 7 other fields | High correlation |
guards is highly overall correlated with status | High correlation |
health is highly overall correlated with status | High correlation |
latitude is highly overall correlated with boro_ct and 7 other fields | High correlation |
longitude is highly overall correlated with cncldist and 7 other fields | High correlation |
sidewalk is highly overall correlated with status | High correlation |
st_assem is highly overall correlated with longitude and 4 other fields | High correlation |
st_senate is highly overall correlated with boro_ct and 7 other fields | High correlation |
status is highly overall correlated with guards and 4 other fields | High correlation |
steward is highly overall correlated with status | High correlation |
stump_diam is highly overall correlated with status | High correlation |
x_sp is highly overall correlated with cncldist and 7 other fields | High correlation |
y_sp is highly overall correlated with boro_ct and 7 other fields | High correlation |
zip_city is highly overall correlated with block_id and 11 other fields | High correlation |
zipcode is highly overall correlated with longitude and 3 other fields | High correlation |
curb_loc is highly imbalanced (76.1%) | Imbalance |
status is highly imbalanced (80.1%) | Imbalance |
steward is highly imbalanced (51.7%) | Imbalance |
guards is highly imbalanced (65.6%) | Imbalance |
root_grate is highly imbalanced (95.3%) | Imbalance |
root_other is highly imbalanced (73.8%) | Imbalance |
trunk_wire is highly imbalanced (86.2%) | Imbalance |
trnk_light is highly imbalanced (98.4%) | Imbalance |
trnk_other is highly imbalanced (72.4%) | Imbalance |
brch_light is highly imbalanced (56.0%) | Imbalance |
brch_shoe is highly imbalanced (99.3%) | Imbalance |
brch_other is highly imbalanced (77.8%) | Imbalance |
health has 31616 (4.6%) missing values | Missing |
spc_latin has 31619 (4.6%) missing values | Missing |
spc_common has 31619 (4.6%) missing values | Missing |
steward has 31615 (4.6%) missing values | Missing |
guards has 31616 (4.6%) missing values | Missing |
sidewalk has 31616 (4.6%) missing values | Missing |
problems has 31664 (4.6%) missing values | Missing |
tree_id has unique values | Unique |
tree_dbh has 17932 (2.6%) zeros | Zeros |
stump_diam has 666134 (97.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-12-13 14:23:17.568128 |
|---|---|
| Analysis finished | 2023-12-13 14:38:30.198462 |
| Duration | 15 minutes and 12.63 seconds |
| Software version | ydata-profiling vv4.6.3 |
| Download configuration | config.json |
tree_id
Real number (ℝ)
UNIQUE 
| Distinct | 683788 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 365205.01 |
| Minimum | 3 |
|---|---|
| Maximum | 722694 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 38056.35 |
| Q1 | 186582.75 |
| median | 366214.5 |
| Q3 | 546170.25 |
| 95-th percentile | 687716.65 |
| Maximum | 722694 |
| Range | 722691 |
| Interquartile range (IQR) | 359587.5 |
Descriptive statistics
| Standard deviation | 208122.09 |
|---|---|
| Coefficient of variation (CV) | 0.56987743 |
| Kurtosis | -1.192863 |
| Mean | 365205.01 |
| Median Absolute Deviation (MAD) | 179812.5 |
| Skewness | -0.017161241 |
| Sum | 2.497228 × 1011 |
| Variance | 4.3314806 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 606945 | 1 | < 0.1% |
| 179292 | 1 | < 0.1% |
| 467244 | 1 | < 0.1% |
| 21140 | 1 | < 0.1% |
| 348376 | 1 | < 0.1% |
| 266930 | 1 | < 0.1% |
| 644028 | 1 | < 0.1% |
| 86378 | 1 | < 0.1% |
| 527011 | 1 | < 0.1% |
| 529114 | 1 | < 0.1% |
| Other values (683778) | 683778 |
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 4 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 14 | 1 |
| Value | Count | Frequency (%) |
| 722694 | 1 | |
| 722693 | 1 | |
| 722692 | 1 | |
| 722691 | 1 | |
| 722690 | 1 | |
| 722689 | 1 | |
| 722688 | 1 | |
| 722687 | 1 | |
| 722686 | 1 | |
| 722685 | 1 |
block_id
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 101390 |
|---|---|
| Distinct (%) | 14.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 313793.1 |
| Minimum | 100002 |
|---|---|
| Maximum | 999999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 100002 |
|---|---|
| 5-th percentile | 107430 |
| Q1 | 221556 |
| median | 319967 |
| Q3 | 404624 |
| 95-th percentile | 510007 |
| Maximum | 999999 |
| Range | 899997 |
| Interquartile range (IQR) | 183068 |
Descriptive statistics
| Standard deviation | 114839.02 |
|---|---|
| Coefficient of variation (CV) | 0.36597053 |
| Kurtosis | -0.51238308 |
| Mean | 313793.1 |
| Median Absolute Deviation (MAD) | 91359 |
| Skewness | 0.081632776 |
| Sum | 2.1456795 × 1011 |
| Variance | 1.3188002 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 204850 | 450 | 0.1% |
| 602362 | 358 | 0.1% |
| 208115 | 250 | < 0.1% |
| 506756 | 206 | < 0.1% |
| 233208 | 197 | < 0.1% |
| 340498 | 195 | < 0.1% |
| 111902 | 178 | < 0.1% |
| 302421 | 159 | < 0.1% |
| 501930 | 145 | < 0.1% |
| 340497 | 135 | < 0.1% |
| Other values (101380) | 681515 |
| Value | Count | Frequency (%) |
| 100002 | 4 | < 0.1% |
| 100003 | 14 | |
| 100004 | 3 | < 0.1% |
| 100005 | 4 | < 0.1% |
| 100014 | 2 | < 0.1% |
| 100015 | 5 | < 0.1% |
| 100016 | 5 | < 0.1% |
| 100018 | 5 | < 0.1% |
| 100019 | 2 | < 0.1% |
| 100028 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 999999 | 5 | < 0.1% |
| 603084 | 1 | < 0.1% |
| 603083 | 3 | < 0.1% |
| 603082 | 6 | < 0.1% |
| 603081 | 6 | < 0.1% |
| 603077 | 9 | < 0.1% |
| 603075 | 4 | < 0.1% |
| 603074 | 36 | |
| 603073 | 34 | |
| 603072 | 35 |
created_at
Date
| Distinct | 483 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| Minimum | 2015-05-19 00:00:00 |
|---|---|
| Maximum | 2016-10-05 00:00:00 |
tree_dbh
Real number (ℝ)
ZEROS 
| Distinct | 146 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.279787 |
| Minimum | 0 |
|---|---|
| Maximum | 450 |
| Zeros | 17932 |
| Zeros (%) | 2.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 9 |
| Q3 | 16 |
| 95-th percentile | 28 |
| Maximum | 450 |
| Range | 450 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 8.7230423 |
|---|---|
| Coefficient of variation (CV) | 0.77333395 |
| Kurtosis | 46.977599 |
| Mean | 11.279787 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 2.4294724 |
| Sum | 7712983 |
| Variance | 76.091466 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 60372 | 8.8% |
| 3 | 54454 | 8.0% |
| 2 | 41977 | 6.1% |
| 5 | 41642 | 6.1% |
| 11 | 37978 | 5.6% |
| 6 | 36519 | 5.3% |
| 7 | 30862 | 4.5% |
| 8 | 30828 | 4.5% |
| 10 | 29672 | 4.3% |
| 9 | 28903 | 4.2% |
| Other values (136) | 290581 |
| Value | Count | Frequency (%) |
| 0 | 17932 | 2.6% |
| 1 | 2899 | 0.4% |
| 2 | 41977 | |
| 3 | 54454 | |
| 4 | 60372 | |
| 5 | 41642 | |
| 6 | 36519 | |
| 7 | 30862 | |
| 8 | 30828 | |
| 9 | 28903 |
| Value | Count | Frequency (%) |
| 450 | 1 | |
| 425 | 1 | |
| 389 | 1 | |
| 318 | 2 | |
| 298 | 1 | |
| 293 | 1 | |
| 291 | 1 | |
| 282 | 1 | |
| 281 | 1 | |
| 266 | 1 |
stump_diam
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 100 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.43246299 |
| Minimum | 0 |
|---|---|
| Maximum | 140 |
| Zeros | 666134 |
| Zeros (%) | 97.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 140 |
| Range | 140 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.2902407 |
|---|---|
| Coefficient of variation (CV) | 7.6081442 |
| Kurtosis | 145.29814 |
| Mean | 0.43246299 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.36345 |
| Sum | 295713 |
| Variance | 10.825684 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 666134 | |
| 4 | 966 | 0.1% |
| 5 | 939 | 0.1% |
| 3 | 779 | 0.1% |
| 6 | 754 | 0.1% |
| 12 | 717 | 0.1% |
| 10 | 716 | 0.1% |
| 8 | 660 | 0.1% |
| 14 | 660 | 0.1% |
| 15 | 648 | 0.1% |
| Other values (90) | 10815 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 666134 | |
| 1 | 106 | < 0.1% |
| 2 | 363 | 0.1% |
| 3 | 779 | 0.1% |
| 4 | 966 | 0.1% |
| 5 | 939 | 0.1% |
| 6 | 754 | 0.1% |
| 7 | 612 | 0.1% |
| 8 | 660 | 0.1% |
| 9 | 530 | 0.1% |
| Value | Count | Frequency (%) |
| 140 | 1 | |
| 134 | 1 | |
| 131 | 1 | |
| 125 | 1 | |
| 120 | 1 | |
| 118 | 1 | |
| 115 | 1 | |
| 109 | 1 | |
| 107 | 1 | |
| 104 | 1 |
curb_loc
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| OnCurb | |
|---|---|
| OffsetFromCurb | 26892 |
Length
| Max length | 14 |
|---|---|
| Median length | 6 |
| Mean length | 6.3146238 |
| Min length | 6 |
Characters and Unicode
| Total characters | 4317864 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OnCurb |
|---|---|
| 2nd row | OnCurb |
| 3rd row | OnCurb |
| 4th row | OnCurb |
| 5th row | OnCurb |
Common Values
| Value | Count | Frequency (%) |
| OnCurb | 656896 | |
| OffsetFromCurb | 26892 | 3.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| oncurb | 656896 | |
| offsetfromcurb | 26892 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 710680 | |
| O | 683788 | |
| C | 683788 | |
| u | 683788 | |
| b | 683788 | |
| n | 656896 | |
| f | 53784 | 1.2% |
| s | 26892 | 0.6% |
| e | 26892 | 0.6% |
| t | 26892 | 0.6% |
| Other values (3) | 80676 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2923396 | |
| Uppercase Letter | 1394468 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 710680 | |
| u | 683788 | |
| b | 683788 | |
| n | 656896 | |
| f | 53784 | 1.8% |
| s | 26892 | 0.9% |
| e | 26892 | 0.9% |
| t | 26892 | 0.9% |
| o | 26892 | 0.9% |
| m | 26892 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 683788 | |
| C | 683788 | |
| F | 26892 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4317864 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 710680 | |
| O | 683788 | |
| C | 683788 | |
| u | 683788 | |
| b | 683788 | |
| n | 656896 | |
| f | 53784 | 1.2% |
| s | 26892 | 0.6% |
| e | 26892 | 0.6% |
| t | 26892 | 0.6% |
| Other values (3) | 80676 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4317864 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 710680 | |
| O | 683788 | |
| C | 683788 | |
| u | 683788 | |
| b | 683788 | |
| n | 656896 | |
| f | 53784 | 1.2% |
| s | 26892 | 0.6% |
| e | 26892 | 0.6% |
| t | 26892 | 0.6% |
| Other values (3) | 80676 | 1.9% |
status
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| Alive | |
|---|---|
| Stump | 17654 |
| Dead | 13961 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.9795829 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3404979 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Alive |
|---|---|
| 2nd row | Alive |
| 3rd row | Alive |
| 4th row | Alive |
| 5th row | Alive |
Common Values
| Value | Count | Frequency (%) |
| Alive | 652173 | |
| Stump | 17654 | 2.6% |
| Dead | 13961 | 2.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| alive | 652173 | |
| stump | 17654 | 2.6% |
| dead | 13961 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 666134 | |
| A | 652173 | |
| l | 652173 | |
| i | 652173 | |
| v | 652173 | |
| S | 17654 | 0.5% |
| t | 17654 | 0.5% |
| u | 17654 | 0.5% |
| m | 17654 | 0.5% |
| p | 17654 | 0.5% |
| Other values (3) | 41883 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2721191 | |
| Uppercase Letter | 683788 | 20.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 666134 | |
| l | 652173 | |
| i | 652173 | |
| v | 652173 | |
| t | 17654 | 0.6% |
| u | 17654 | 0.6% |
| m | 17654 | 0.6% |
| p | 17654 | 0.6% |
| a | 13961 | 0.5% |
| d | 13961 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 652173 | |
| S | 17654 | 2.6% |
| D | 13961 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3404979 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 666134 | |
| A | 652173 | |
| l | 652173 | |
| i | 652173 | |
| v | 652173 | |
| S | 17654 | 0.5% |
| t | 17654 | 0.5% |
| u | 17654 | 0.5% |
| m | 17654 | 0.5% |
| p | 17654 | 0.5% |
| Other values (3) | 41883 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3404979 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 666134 | |
| A | 652173 | |
| l | 652173 | |
| i | 652173 | |
| v | 652173 | |
| S | 17654 | 0.5% |
| t | 17654 | 0.5% |
| u | 17654 | 0.5% |
| m | 17654 | 0.5% |
| p | 17654 | 0.5% |
| Other values (3) | 41883 | 1.2% |
health
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31616 |
| Missing (%) | 4.6% |
| Memory size | 5.2 MiB |
| Good | |
|---|---|
| Fair | |
| Poor | 26818 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2608688 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Good |
|---|---|
| 2nd row | Good |
| 3rd row | Good |
| 4th row | Good |
| 5th row | Good |
Common Values
| Value | Count | Frequency (%) |
| Good | 528850 | |
| Fair | 96504 | 14.1% |
| Poor | 26818 | 3.9% |
| (Missing) | 31616 | 4.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| good | 528850 | |
| fair | 96504 | 14.8% |
| poor | 26818 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1111336 | |
| G | 528850 | |
| d | 528850 | |
| r | 123322 | 4.7% |
| F | 96504 | 3.7% |
| a | 96504 | 3.7% |
| i | 96504 | 3.7% |
| P | 26818 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1956516 | |
| Uppercase Letter | 652172 | 25.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1111336 | |
| d | 528850 | |
| r | 123322 | 6.3% |
| a | 96504 | 4.9% |
| i | 96504 | 4.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 528850 | |
| F | 96504 | 14.8% |
| P | 26818 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2608688 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1111336 | |
| G | 528850 | |
| d | 528850 | |
| r | 123322 | 4.7% |
| F | 96504 | 3.7% |
| a | 96504 | 3.7% |
| i | 96504 | 3.7% |
| P | 26818 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2608688 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 1111336 | |
| G | 528850 | |
| d | 528850 | |
| r | 123322 | 4.7% |
| F | 96504 | 3.7% |
| a | 96504 | 3.7% |
| i | 96504 | 3.7% |
| P | 26818 | 1.0% |
spc_latin
Text
MISSING 
| Distinct | 132 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31619 |
| Missing (%) | 4.6% |
| Memory size | 5.2 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 28 |
| Mean length | 18.051819 |
| Min length | 4 |
Characters and Unicode
| Total characters | 11772837 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Fraxinus pennsylvanica |
|---|---|
| 2nd row | Gleditsia triacanthos var. inermis |
| 3rd row | Pyrus calleryana |
| 4th row | Pyrus calleryana |
| 5th row | Prunus virginiana |
| Value | Count | Frequency (%) |
| acer | 88739 | 6.0% |
| x | 87130 | 5.9% |
| platanus | 87014 | 5.9% |
| acerifolia | 87014 | 5.9% |
| quercus | 82867 | 5.6% |
| var | 64605 | 4.4% |
| inermis | 64605 | 4.4% |
| gleditsia | 64264 | 4.3% |
| triacanthos | 64264 | 4.3% |
| pyrus | 58931 | 4.0% |
| Other values (168) | 732098 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1560128 | |
| i | 1049571 | 8.9% |
| r | 993219 | 8.4% |
| s | 835821 | 7.1% |
| 829362 | 7.0% | |
| l | 692257 | 5.9% |
| e | 689988 | 5.9% |
| u | 685213 | 5.8% |
| n | 619786 | 5.3% |
| c | 584030 | 5.0% |
| Other values (40) | 3233462 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10203009 | |
| Space Separator | 829362 | 7.0% |
| Uppercase Letter | 664015 | 5.6% |
| Other Punctuation | 76451 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1560128 | |
| i | 1049571 | |
| r | 993219 | |
| s | 835821 | |
| l | 692257 | 6.8% |
| e | 689988 | 6.8% |
| u | 685213 | 6.7% |
| n | 619786 | 6.1% |
| c | 584030 | 5.7% |
| t | 539091 | 5.3% |
| Other values (16) | 1953905 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 190226 | |
| A | 93024 | |
| G | 88652 | |
| Q | 82867 | |
| T | 53125 | 8.0% |
| Z | 29258 | 4.4% |
| C | 26340 | 4.0% |
| S | 25213 | 3.8% |
| F | 20379 | 3.1% |
| U | 14915 | 2.2% |
| Other values (11) | 40016 | 6.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 64605 | |
| ' | 11846 | 15.5% |
Space Separator
| Value | Count | Frequency (%) |
| 829362 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10867024 | |
| Common | 905813 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1560128 | |
| i | 1049571 | |
| r | 993219 | 9.1% |
| s | 835821 | 7.7% |
| l | 692257 | 6.4% |
| e | 689988 | 6.3% |
| u | 685213 | 6.3% |
| n | 619786 | 5.7% |
| c | 584030 | 5.4% |
| t | 539091 | 5.0% |
| Other values (37) | 2617920 |
Common
| Value | Count | Frequency (%) |
| 829362 | ||
| . | 64605 | 7.1% |
| ' | 11846 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11772837 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1560128 | |
| i | 1049571 | 8.9% |
| r | 993219 | 8.4% |
| s | 835821 | 7.1% |
| 829362 | 7.0% | |
| l | 692257 | 5.9% |
| e | 689988 | 5.9% |
| u | 685213 | 5.8% |
| n | 619786 | 5.3% |
| c | 584030 | 5.0% |
| Other values (40) | 3233462 |
spc_common
Text
MISSING 
| Distinct | 132 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31619 |
| Missing (%) | 4.6% |
| Memory size | 5.2 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 20 |
| Mean length | 11.968832 |
| Min length | 3 |
Characters and Unicode
| Total characters | 7805701 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | green ash |
|---|---|
| 2nd row | honeylocust |
| 3rd row | Callery pear |
| 4th row | Callery pear |
| 5th row | 'Schubert' chokecherry |
| Value | Count | Frequency (%) |
| maple | 88675 | 7.6% |
| london | 87014 | 7.4% |
| planetree | 87014 | 7.4% |
| oak | 82867 | 7.1% |
| honeylocust | 64264 | 5.5% |
| callery | 58931 | 5.0% |
| pear | 58931 | 5.0% |
| pin | 53185 | 4.6% |
| linden | 51267 | 4.4% |
| japanese | 35774 | 3.1% |
| Other values (135) | 500206 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1090341 | |
| a | 697628 | 8.9% |
| n | 661863 | 8.5% |
| l | 632093 | 8.1% |
| o | 590406 | 7.6% |
| r | 548056 | 7.0% |
| 515959 | 6.6% | |
| p | 390862 | 5.0% |
| t | 289664 | 3.7% |
| i | 261377 | 3.3% |
| Other values (32) | 2127452 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6981119 | |
| Space Separator | 515959 | 6.6% |
| Uppercase Letter | 289070 | 3.7% |
| Other Punctuation | 11263 | 0.1% |
| Dash Punctuation | 8290 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1090341 | |
| a | 697628 | |
| n | 661863 | |
| l | 632093 | 9.1% |
| o | 590406 | 8.5% |
| r | 548056 | 7.9% |
| p | 390862 | 5.6% |
| t | 289664 | 4.1% |
| i | 261377 | 3.7% |
| y | 209494 | 3.0% |
| Other values (15) | 1609335 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 87014 | |
| C | 66211 | |
| J | 35774 | |
| N | 34544 | 12.0% |
| A | 29293 | 10.1% |
| S | 27392 | 9.5% |
| E | 3915 | 1.4% |
| K | 3843 | 1.3% |
| O | 323 | 0.1% |
| T | 317 | 0.1% |
| Other values (4) | 444 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 515959 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 11263 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8290 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7270189 | |
| Common | 535512 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1090341 | |
| a | 697628 | 9.6% |
| n | 661863 | 9.1% |
| l | 632093 | 8.7% |
| o | 590406 | 8.1% |
| r | 548056 | 7.5% |
| p | 390862 | 5.4% |
| t | 289664 | 4.0% |
| i | 261377 | 3.6% |
| y | 209494 | 2.9% |
| Other values (29) | 1898405 |
Common
| Value | Count | Frequency (%) |
| 515959 | ||
| ' | 11263 | 2.1% |
| - | 8290 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7805701 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1090341 | |
| a | 697628 | 8.9% |
| n | 661863 | 8.5% |
| l | 632093 | 8.1% |
| o | 590406 | 7.6% |
| r | 548056 | 7.0% |
| 515959 | 6.6% | |
| p | 390862 | 5.0% |
| t | 289664 | 3.7% |
| i | 261377 | 3.3% |
| Other values (32) | 2127452 |
steward
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31615 |
| Missing (%) | 4.6% |
| Memory size | 5.2 MiB |
| None | |
|---|---|
| 1or2 | |
| 3or4 | 19183 |
| 4orMore | 1610 |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.007406 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2613522 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | None |
|---|---|
| 2nd row | None |
| 3rd row | None |
| 4th row | None |
| 5th row | None |
Common Values
| Value | Count | Frequency (%) |
| None | 487823 | |
| 1or2 | 143557 | 21.0% |
| 3or4 | 19183 | 2.8% |
| 4orMore | 1610 | 0.2% |
| (Missing) | 31615 | 4.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| none | 487823 | |
| 1or2 | 143557 | 22.0% |
| 3or4 | 19183 | 2.9% |
| 4ormore | 1610 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 653783 | |
| e | 489433 | |
| N | 487823 | |
| n | 487823 | |
| r | 165960 | 6.4% |
| 1 | 143557 | 5.5% |
| 2 | 143557 | 5.5% |
| 4 | 20793 | 0.8% |
| 3 | 19183 | 0.7% |
| M | 1610 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1796999 | |
| Uppercase Letter | 489433 | 18.7% |
| Decimal Number | 327090 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 653783 | |
| e | 489433 | |
| n | 487823 | |
| r | 165960 | 9.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 143557 | |
| 2 | 143557 | |
| 4 | 20793 | 6.4% |
| 3 | 19183 | 5.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 487823 | |
| M | 1610 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2286432 | |
| Common | 327090 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 653783 | |
| e | 489433 | |
| N | 487823 | |
| n | 487823 | |
| r | 165960 | 7.3% |
| M | 1610 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 1 | 143557 | |
| 2 | 143557 | |
| 4 | 20793 | 6.4% |
| 3 | 19183 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2613522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 653783 | |
| e | 489433 | |
| N | 487823 | |
| n | 487823 | |
| r | 165960 | 6.4% |
| 1 | 143557 | 5.5% |
| 2 | 143557 | 5.5% |
| 4 | 20793 | 0.8% |
| 3 | 19183 | 0.7% |
| M | 1610 | 0.1% |
guards
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31616 |
| Missing (%) | 4.6% |
| Memory size | 5.2 MiB |
| None | |
|---|---|
| Helpful | 51866 |
| Harmful | 20252 |
| Unsure | 7748 |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.3555044 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2840538 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | None |
|---|---|
| 2nd row | None |
| 3rd row | None |
| 4th row | None |
| 5th row | None |
Common Values
| Value | Count | Frequency (%) |
| None | 572306 | |
| Helpful | 51866 | 7.6% |
| Harmful | 20252 | 3.0% |
| Unsure | 7748 | 1.1% |
| (Missing) | 31616 | 4.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| none | 572306 | |
| helpful | 51866 | 8.0% |
| harmful | 20252 | 3.1% |
| unsure | 7748 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 631920 | |
| n | 580054 | |
| N | 572306 | |
| o | 572306 | |
| l | 123984 | 4.4% |
| u | 79866 | 2.8% |
| H | 72118 | 2.5% |
| f | 72118 | 2.5% |
| p | 51866 | 1.8% |
| r | 28000 | 1.0% |
| Other values (4) | 56000 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2188366 | |
| Uppercase Letter | 652172 | 23.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 631920 | |
| n | 580054 | |
| o | 572306 | |
| l | 123984 | 5.7% |
| u | 79866 | 3.6% |
| f | 72118 | 3.3% |
| p | 51866 | 2.4% |
| r | 28000 | 1.3% |
| a | 20252 | 0.9% |
| m | 20252 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 572306 | |
| H | 72118 | 11.1% |
| U | 7748 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2840538 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 631920 | |
| n | 580054 | |
| N | 572306 | |
| o | 572306 | |
| l | 123984 | 4.4% |
| u | 79866 | 2.8% |
| H | 72118 | 2.5% |
| f | 72118 | 2.5% |
| p | 51866 | 1.8% |
| r | 28000 | 1.0% |
| Other values (4) | 56000 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2840538 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 631920 | |
| n | 580054 | |
| N | 572306 | |
| o | 572306 | |
| l | 123984 | 4.4% |
| u | 79866 | 2.8% |
| H | 72118 | 2.5% |
| f | 72118 | 2.5% |
| p | 51866 | 1.8% |
| r | 28000 | 1.0% |
| Other values (4) | 56000 | 2.0% |
sidewalk
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31616 |
| Missing (%) | 4.6% |
| Memory size | 5.2 MiB |
| NoDamage | |
|---|---|
| Damage |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.4259367 |
| Min length | 6 |
Characters and Unicode
| Total characters | 4842988 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NoDamage |
|---|---|
| 2nd row | NoDamage |
| 3rd row | NoDamage |
| 4th row | NoDamage |
| 5th row | NoDamage |
Common Values
| Value | Count | Frequency (%) |
| NoDamage | 464978 | |
| Damage | 187194 | |
| (Missing) | 31616 | 4.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| nodamage | 464978 | |
| damage | 187194 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1304344 | |
| D | 652172 | |
| m | 652172 | |
| g | 652172 | |
| e | 652172 | |
| N | 464978 | 9.6% |
| o | 464978 | 9.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3725838 | |
| Uppercase Letter | 1117150 | 23.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1304344 | |
| m | 652172 | |
| g | 652172 | |
| e | 652172 | |
| o | 464978 | 12.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 652172 | |
| N | 464978 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4842988 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1304344 | |
| D | 652172 | |
| m | 652172 | |
| g | 652172 | |
| e | 652172 | |
| N | 464978 | 9.6% |
| o | 464978 | 9.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4842988 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1304344 | |
| D | 652172 | |
| m | 652172 | |
| g | 652172 | |
| e | 652172 | |
| N | 464978 | 9.6% |
| o | 464978 | 9.6% |
user_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| TreesCount Staff | |
|---|---|
| Volunteer | |
| NYC Parks Staff |
Length
| Max length | 16 |
|---|---|
| Median length | 15 |
| Mean length | 13.524654 |
| Min length | 9 |
Characters and Unicode
| Total characters | 9247996 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TreesCount Staff |
|---|---|
| 2nd row | Volunteer |
| 3rd row | TreesCount Staff |
| 4th row | TreesCount Staff |
| 5th row | TreesCount Staff |
Common Values
| Value | Count | Frequency (%) |
| TreesCount Staff | 296284 | |
| Volunteer | 217518 | |
| NYC Parks Staff | 169986 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| staff | 466270 | |
| treescount | 296284 | |
| volunteer | 217518 | |
| nyc | 169986 | 12.9% |
| parks | 169986 | 12.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1027604 | |
| t | 980072 | |
| f | 932540 | 10.1% |
| r | 683788 | 7.4% |
| 636256 | 6.9% | |
| a | 636256 | 6.9% |
| o | 513802 | 5.6% |
| u | 513802 | 5.6% |
| n | 513802 | 5.6% |
| s | 466270 | 5.0% |
| Other values (9) | 2343804 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6655440 | |
| Uppercase Letter | 1956300 | 21.2% |
| Space Separator | 636256 | 6.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1027604 | |
| t | 980072 | |
| f | 932540 | |
| r | 683788 | |
| a | 636256 | |
| o | 513802 | |
| u | 513802 | |
| n | 513802 | |
| s | 466270 | |
| l | 217518 | 3.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 466270 | |
| S | 466270 | |
| T | 296284 | |
| V | 217518 | |
| N | 169986 | 8.7% |
| Y | 169986 | 8.7% |
| P | 169986 | 8.7% |
Space Separator
| Value | Count | Frequency (%) |
| 636256 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8611740 | |
| Common | 636256 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1027604 | |
| t | 980072 | |
| f | 932540 | |
| r | 683788 | 7.9% |
| a | 636256 | 7.4% |
| o | 513802 | 6.0% |
| u | 513802 | 6.0% |
| n | 513802 | 6.0% |
| s | 466270 | 5.4% |
| C | 466270 | 5.4% |
| Other values (8) | 1877534 |
Common
| Value | Count | Frequency (%) |
| 636256 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9247996 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1027604 | |
| t | 980072 | |
| f | 932540 | 10.1% |
| r | 683788 | 7.4% |
| 636256 | 6.9% | |
| a | 636256 | 6.9% |
| o | 513802 | 5.6% |
| u | 513802 | 5.6% |
| n | 513802 | 5.6% |
| s | 466270 | 5.0% |
| Other values (9) | 2343804 |
problems
Text
MISSING 
| Distinct | 232 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31664 |
| Missing (%) | 4.6% |
| Memory size | 5.2 MiB |
Length
| Max length | 87 |
|---|---|
| Median length | 4 |
| Mean length | 6.6444695 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4333018 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 45 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Stones |
|---|---|
| 2nd row | BranchLights |
| 3rd row | BranchLights |
| 4th row | None |
| 5th row | BranchLights |
| Value | Count | Frequency (%) |
| none | 426280 | |
| stones | 95673 | 14.7% |
| branchlights | 29452 | 4.5% |
| stonesbranchlights | 17808 | 2.7% |
| rootother | 11418 | 1.8% |
| trunkother | 11143 | 1.7% |
| branchother | 8352 | 1.3% |
| stonestrunkother | 5183 | 0.8% |
| stonesrootother | 4468 | 0.7% |
| wiresrope | 4095 | 0.6% |
| Other values (222) | 38252 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 687971 | |
| n | 687014 | |
| o | 640197 | |
| N | 426280 | |
| t | 328039 | |
| h | 237366 | 5.5% |
| r | 224795 | 5.2% |
| s | 220616 | 5.1% |
| S | 140410 | 3.2% |
| a | 94203 | 2.2% |
| Other values (15) | 646127 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3431416 | |
| Uppercase Letter | 901602 | 20.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 687971 | |
| n | 687014 | |
| o | 640197 | |
| t | 328039 | |
| h | 237366 | 6.9% |
| r | 224795 | 6.6% |
| s | 220616 | 6.4% |
| a | 94203 | 2.7% |
| c | 86720 | 2.5% |
| i | 76670 | 2.2% |
| Other values (5) | 147825 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 426280 | |
| S | 140410 | 15.6% |
| O | 87250 | 9.7% |
| B | 86720 | 9.6% |
| L | 63396 | 7.0% |
| R | 43596 | 4.8% |
| T | 33604 | 3.7% |
| W | 13274 | 1.5% |
| M | 3536 | 0.4% |
| G | 3536 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4333018 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 687971 | |
| n | 687014 | |
| o | 640197 | |
| N | 426280 | |
| t | 328039 | |
| h | 237366 | 5.5% |
| r | 224795 | 5.2% |
| s | 220616 | 5.1% |
| S | 140410 | 3.2% |
| a | 94203 | 2.2% |
| Other values (15) | 646127 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4333018 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 687971 | |
| n | 687014 | |
| o | 640197 | |
| N | 426280 | |
| t | 328039 | |
| h | 237366 | 5.5% |
| r | 224795 | 5.2% |
| s | 220616 | 5.1% |
| S | 140410 | 3.2% |
| a | 94203 | 2.2% |
| Other values (15) | 646127 |
root_stone
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 667.9 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 543789 | |
| True | 139999 | 20.5% |
root_grate
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 667.9 KiB |
| False | |
|---|---|
| True | 3536 |
| Value | Count | Frequency (%) |
| False | 680252 | |
| True | 3536 | 0.5% |
root_other
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 667.9 KiB |
| False | |
|---|---|
| True | 30322 |
| Value | Count | Frequency (%) |
| False | 653466 | |
| True | 30322 | 4.4% |
trunk_wire
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 667.9 KiB |
| False | |
|---|---|
| True | 13274 |
| Value | Count | Frequency (%) |
| False | 670514 | |
| True | 13274 | 1.9% |
trnk_light
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 667.9 KiB |
| False | |
|---|---|
| True | 1031 |
| Value | Count | Frequency (%) |
| False | 682757 | |
| True | 1031 | 0.2% |
trnk_other
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 667.9 KiB |
| False | |
|---|---|
| True | 32573 |
| Value | Count | Frequency (%) |
| False | 651215 | |
| True | 32573 | 4.8% |
brch_light
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 667.9 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 621423 | |
| True | 62365 | 9.1% |
brch_shoe
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 667.9 KiB |
| False | |
|---|---|
| True | 411 |
| Value | Count | Frequency (%) |
| False | 683377 | |
| True | 411 | 0.1% |
brch_other
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 667.9 KiB |
| False | |
|---|---|
| True | 24355 |
| Value | Count | Frequency (%) |
| False | 659433 | |
| True | 24355 | 3.6% |
address
Text
| Distinct | 408701 |
|---|---|
| Distinct (%) | 59.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 38 |
| Mean length | 18.023022 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12323926 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 275576 ? |
|---|---|
| Unique (%) | 40.3% |
Sample
| 1st row | 76-046 164 STREET |
|---|---|
| 2nd row | 72-020 32 AVENUE |
| 3rd row | 153-026 119 AVENUE |
| 4th row | 89 89 STREET |
| 5th row | 559 BEACH 68 STREET |
| Value | Count | Frequency (%) |
| street | 294164 | 13.4% |
| avenue | 256523 | 11.7% |
| east | 56810 | 2.6% |
| road | 32465 | 1.5% |
| west | 28418 | 1.3% |
| boulevard | 26564 | 1.2% |
| place | 24797 | 1.1% |
| parkway | 12442 | 0.6% |
| drive | 10092 | 0.5% |
| beach | 7148 | 0.3% |
| Other values (29903) | 1438138 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1564995 | 12.7% |
| 1503773 | 12.2% | |
| T | 847317 | 6.9% |
| A | 680093 | 5.5% |
| R | 632291 | 5.1% |
| 1 | 625603 | 5.1% |
| 0 | 548667 | 4.5% |
| S | 538560 | 4.4% |
| N | 506195 | 4.1% |
| 2 | 422115 | 3.4% |
| Other values (29) | 4454317 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7205359 | |
| Decimal Number | 3375683 | |
| Space Separator | 1503773 | 12.2% |
| Dash Punctuation | 238924 | 1.9% |
| Other Punctuation | 187 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1564995 | |
| T | 847317 | |
| A | 680093 | |
| R | 632291 | |
| S | 538560 | 7.5% |
| N | 506195 | 7.0% |
| U | 352144 | 4.9% |
| V | 324424 | 4.5% |
| O | 290207 | 4.0% |
| L | 239144 | 3.3% |
| Other values (16) | 1229989 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 625603 | |
| 0 | 548667 | |
| 2 | 422115 | |
| 3 | 307178 | |
| 4 | 297043 | |
| 5 | 287307 | |
| 6 | 245067 | 7.3% |
| 7 | 225537 | 6.7% |
| 8 | 221016 | 6.5% |
| 9 | 196150 | 5.8% |
Space Separator
| Value | Count | Frequency (%) |
| 1503773 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 238924 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 187 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7205359 | |
| Common | 5118567 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1564995 | |
| T | 847317 | |
| A | 680093 | |
| R | 632291 | |
| S | 538560 | 7.5% |
| N | 506195 | 7.0% |
| U | 352144 | 4.9% |
| V | 324424 | 4.5% |
| O | 290207 | 4.0% |
| L | 239144 | 3.3% |
| Other values (16) | 1229989 |
Common
| Value | Count | Frequency (%) |
| 1503773 | ||
| 1 | 625603 | |
| 0 | 548667 | 10.7% |
| 2 | 422115 | 8.2% |
| 3 | 307178 | 6.0% |
| 4 | 297043 | 5.8% |
| 5 | 287307 | 5.6% |
| 6 | 245067 | 4.8% |
| - | 238924 | 4.7% |
| 7 | 225537 | 4.4% |
| Other values (3) | 417353 | 8.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12323926 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1564995 | 12.7% |
| 1503773 | 12.2% | |
| T | 847317 | 6.9% |
| A | 680093 | 5.5% |
| R | 632291 | 5.1% |
| 1 | 625603 | 5.1% |
| 0 | 548667 | 4.5% |
| S | 538560 | 4.4% |
| N | 506195 | 4.1% |
| 2 | 422115 | 3.4% |
| Other values (29) | 4454317 |
zipcode
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 191 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10916.246 |
| Minimum | 83 |
|---|---|
| Maximum | 11697 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 83 |
|---|---|
| 5-th percentile | 10025 |
| Q1 | 10451 |
| median | 11214 |
| Q3 | 11365 |
| 95-th percentile | 11432 |
| Maximum | 11697 |
| Range | 11614 |
| Interquartile range (IQR) | 914 |
Descriptive statistics
| Standard deviation | 651.55336 |
|---|---|
| Coefficient of variation (CV) | 0.059686577 |
| Kurtosis | 102.1138 |
| Mean | 10916.246 |
| Median Absolute Deviation (MAD) | 203 |
| Skewness | -6.5077556 |
| Sum | 7.464398 × 109 |
| Variance | 424521.79 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10312 | 22186 | 3.2% |
| 10314 | 16905 | 2.5% |
| 10306 | 13030 | 1.9% |
| 10309 | 12650 | 1.8% |
| 11234 | 11253 | 1.6% |
| 11385 | 10937 | 1.6% |
| 11357 | 9449 | 1.4% |
| 11207 | 8634 | 1.3% |
| 11434 | 8274 | 1.2% |
| 11208 | 8245 | 1.2% |
| Other values (181) | 562225 |
| Value | Count | Frequency (%) |
| 83 | 935 | |
| 10001 | 911 | |
| 10002 | 2265 | |
| 10003 | 2025 | |
| 10004 | 118 | < 0.1% |
| 10005 | 144 | < 0.1% |
| 10006 | 53 | < 0.1% |
| 10007 | 355 | 0.1% |
| 10009 | 1924 | |
| 10010 | 889 | 0.1% |
| Value | Count | Frequency (%) |
| 11697 | 30 | < 0.1% |
| 11694 | 3572 | |
| 11693 | 1169 | 0.2% |
| 11692 | 2013 | 0.3% |
| 11691 | 5718 | |
| 11451 | 12 | < 0.1% |
| 11436 | 2407 | 0.4% |
| 11435 | 4595 | |
| 11434 | 8274 | |
| 11433 | 3745 |
zip_city
Categorical
HIGH CORRELATION 
| Distinct | 48 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| Brooklyn | |
|---|---|
| Staten Island | |
| Bronx | |
| New York | |
| Jamaica | |
| Other values (43) |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 9.3159605 |
| Min length | 5 |
Characters and Unicode
| Total characters | 6370142 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Fresh Meadows |
|---|---|
| 2nd row | East Elmhurst |
| 3rd row | Jamaica |
| 4th row | Brooklyn |
| 5th row | Arverne |
Common Values
| Value | Count | Frequency (%) |
| Brooklyn | 177300 | |
| Staten Island | 105318 | |
| Bronx | 85203 | |
| New York | 64488 | 9.4% |
| Jamaica | 26028 | 3.8% |
| Flushing | 23389 | 3.4% |
| Ridgewood | 10937 | 1.6% |
| Fresh Meadows | 10441 | 1.5% |
| Queens Village | 10127 | 1.5% |
| Astoria | 10007 | 1.5% |
| Other values (38) | 160550 |
Length
| Value | Count | Frequency (%) |
| brooklyn | 177300 | |
| island | 108797 | 11.0% |
| staten | 105318 | 10.6% |
| bronx | 85203 | 8.6% |
| new | 65353 | 6.6% |
| york | 64488 | 6.5% |
| jamaica | 26028 | 2.6% |
| flushing | 23389 | 2.4% |
| park | 20945 | 2.1% |
| gardens | 16267 | 1.6% |
| Other values (52) | 296328 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 662058 | 10.4% |
| n | 604433 | 9.5% |
| a | 474555 | 7.4% |
| l | 446198 | 7.0% |
| r | 440622 | 6.9% |
| e | 400969 | 6.3% |
| t | 309410 | 4.9% |
| 305628 | 4.8% | |
| k | 294951 | 4.6% |
| s | 283899 | 4.5% |
| Other values (36) | 2147419 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5075098 | |
| Uppercase Letter | 989416 | 15.5% |
| Space Separator | 305628 | 4.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 662058 | |
| n | 604433 | |
| a | 474555 | |
| l | 446198 | |
| r | 440622 | |
| e | 400969 | |
| t | 309410 | 6.1% |
| k | 294951 | 5.8% |
| s | 283899 | 5.6% |
| d | 224233 | 4.4% |
| Other values (14) | 933770 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 280983 | |
| S | 127448 | |
| I | 108806 | 11.0% |
| N | 72633 | 7.3% |
| Y | 64488 | 6.5% |
| F | 49315 | 5.0% |
| R | 37000 | 3.7% |
| J | 29323 | 3.0% |
| H | 28885 | 2.9% |
| P | 24074 | 2.4% |
| Other values (11) | 166461 |
Space Separator
| Value | Count | Frequency (%) |
| 305628 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6064514 | |
| Common | 305628 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 662058 | 10.9% |
| n | 604433 | 10.0% |
| a | 474555 | 7.8% |
| l | 446198 | 7.4% |
| r | 440622 | 7.3% |
| e | 400969 | 6.6% |
| t | 309410 | 5.1% |
| k | 294951 | 4.9% |
| s | 283899 | 4.7% |
| B | 280983 | 4.6% |
| Other values (35) | 1866436 |
Common
| Value | Count | Frequency (%) |
| 305628 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6370142 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 662058 | 10.4% |
| n | 604433 | 9.5% |
| a | 474555 | 7.4% |
| l | 446198 | 7.0% |
| r | 440622 | 6.9% |
| e | 400969 | 6.3% |
| t | 309410 | 4.9% |
| 305628 | 4.8% | |
| k | 294951 | 4.6% |
| s | 283899 | 4.5% |
| Other values (36) | 2147419 |
cb_num
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 59 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 343.5054 |
| Minimum | 101 |
|---|---|
| Maximum | 503 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 101 |
|---|---|
| 5-th percentile | 108 |
| Q1 | 302 |
| median | 402 |
| Q3 | 412 |
| 95-th percentile | 503 |
| Maximum | 503 |
| Range | 402 |
| Interquartile range (IQR) | 110 |
Descriptive statistics
| Standard deviation | 115.7406 |
|---|---|
| Coefficient of variation (CV) | 0.33693968 |
| Kurtosis | -0.51111783 |
| Mean | 343.5054 |
| Median Absolute Deviation (MAD) | 92 |
| Skewness | -0.56103898 |
| Sum | 2.3488487 × 108 |
| Variance | 13395.887 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 503 | 53934 | 7.9% |
| 413 | 37017 | 5.4% |
| 407 | 30620 | 4.5% |
| 411 | 28071 | 4.1% |
| 412 | 26379 | 3.9% |
| 502 | 25717 | 3.8% |
| 501 | 25667 | 3.8% |
| 408 | 20383 | 3.0% |
| 405 | 19550 | 2.9% |
| 318 | 19319 | 2.8% |
| Other values (49) | 397131 |
| Value | Count | Frequency (%) |
| 101 | 2397 | 0.4% |
| 102 | 5019 | |
| 103 | 4939 | |
| 104 | 4704 | |
| 105 | 2156 | 0.3% |
| 106 | 5061 | |
| 107 | 8814 | |
| 108 | 9269 | |
| 109 | 4987 | |
| 110 | 5962 |
| Value | Count | Frequency (%) |
| 503 | 53934 | |
| 502 | 25717 | |
| 501 | 25667 | |
| 414 | 12412 | 1.8% |
| 413 | 37017 | |
| 412 | 26379 | |
| 411 | 28071 | |
| 410 | 15224 | 2.2% |
| 409 | 11481 | 1.7% |
| 408 | 20383 | 3.0% |
borocode
Categorical
HIGH CORRELATION 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| 4 | |
|---|---|
| 3 | |
| 5 | |
| 2 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 683788 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 4 |
| 3rd row | 4 |
| 4th row | 3 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 4 | 250551 | |
| 3 | 177293 | |
| 5 | 105318 | |
| 2 | 85203 | 12.5% |
| 1 | 65423 | 9.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4 | 250551 | |
| 3 | 177293 | |
| 5 | 105318 | |
| 2 | 85203 | 12.5% |
| 1 | 65423 | 9.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 250551 | |
| 3 | 177293 | |
| 5 | 105318 | |
| 2 | 85203 | 12.5% |
| 1 | 65423 | 9.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 683788 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 250551 | |
| 3 | 177293 | |
| 5 | 105318 | |
| 2 | 85203 | 12.5% |
| 1 | 65423 | 9.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 683788 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 250551 | |
| 3 | 177293 | |
| 5 | 105318 | |
| 2 | 85203 | 12.5% |
| 1 | 65423 | 9.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 683788 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 250551 | |
| 3 | 177293 | |
| 5 | 105318 | |
| 2 | 85203 | 12.5% |
| 1 | 65423 | 9.6% |
boroname
Categorical
HIGH CORRELATION 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| Queens | |
|---|---|
| Brooklyn | |
| Staten Island | |
| Bronx | |
| Manhattan |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.7591388 |
| Min length | 5 |
Characters and Unicode
| Total characters | 5305606 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Queens |
|---|---|
| 2nd row | Queens |
| 3rd row | Queens |
| 4th row | Brooklyn |
| 5th row | Queens |
Common Values
| Value | Count | Frequency (%) |
| Queens | 250551 | |
| Brooklyn | 177293 | |
| Staten Island | 105318 | |
| Bronx | 85203 | 12.5% |
| Manhattan | 65423 | 9.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| queens | 250551 | |
| brooklyn | 177293 | |
| staten | 105318 | |
| island | 105318 | |
| bronx | 85203 | 10.8% |
| manhattan | 65423 | 8.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 854529 | |
| e | 606420 | |
| o | 439789 | 8.3% |
| a | 406905 | 7.7% |
| s | 355869 | 6.7% |
| t | 341482 | 6.4% |
| l | 282611 | 5.3% |
| B | 262496 | 4.9% |
| r | 262496 | 4.9% |
| Q | 250551 | 4.7% |
| Other values (10) | 1242458 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4411182 | |
| Uppercase Letter | 789106 | 14.9% |
| Space Separator | 105318 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 854529 | |
| e | 606420 | |
| o | 439789 | |
| a | 406905 | |
| s | 355869 | |
| t | 341482 | 7.7% |
| l | 282611 | 6.4% |
| r | 262496 | 6.0% |
| u | 250551 | 5.7% |
| y | 177293 | 4.0% |
| Other values (4) | 433237 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 262496 | |
| Q | 250551 | |
| S | 105318 | |
| I | 105318 | |
| M | 65423 | 8.3% |
Space Separator
| Value | Count | Frequency (%) |
| 105318 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5200288 | |
| Common | 105318 | 2.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 854529 | |
| e | 606420 | |
| o | 439789 | 8.5% |
| a | 406905 | 7.8% |
| s | 355869 | 6.8% |
| t | 341482 | 6.6% |
| l | 282611 | 5.4% |
| B | 262496 | 5.0% |
| r | 262496 | 5.0% |
| Q | 250551 | 4.8% |
| Other values (9) | 1137140 |
Common
| Value | Count | Frequency (%) |
| 105318 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5305606 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 854529 | |
| e | 606420 | |
| o | 439789 | 8.3% |
| a | 406905 | 7.7% |
| s | 355869 | 6.7% |
| t | 341482 | 6.4% |
| l | 282611 | 5.3% |
| B | 262496 | 4.9% |
| r | 262496 | 4.9% |
| Q | 250551 | 4.7% |
| Other values (10) | 1242458 |
cncldist
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.943181 |
| Minimum | 1 |
|---|---|
| Maximum | 51 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 19 |
| median | 30 |
| Q3 | 43 |
| 95-th percentile | 51 |
| Maximum | 51 |
| Range | 50 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 14.328531 |
|---|---|
| Coefficient of variation (CV) | 0.47852401 |
| Kurtosis | -1.0505651 |
| Mean | 29.943181 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.10899332 |
| Sum | 20474788 |
| Variance | 205.30681 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 51 | 51236 | 7.5% |
| 19 | 34429 | 5.0% |
| 50 | 33035 | 4.8% |
| 23 | 30743 | 4.5% |
| 31 | 23161 | 3.4% |
| 49 | 21047 | 3.1% |
| 27 | 20116 | 2.9% |
| 32 | 19508 | 2.9% |
| 24 | 18993 | 2.8% |
| 30 | 18551 | 2.7% |
| Other values (41) | 412969 |
| Value | Count | Frequency (%) |
| 1 | 5694 | |
| 2 | 5564 | |
| 3 | 8631 | |
| 4 | 8521 | |
| 5 | 4982 | |
| 6 | 8050 | |
| 7 | 6572 | |
| 8 | 7293 | |
| 9 | 8213 | |
| 10 | 6501 |
| Value | Count | Frequency (%) |
| 51 | 51236 | |
| 50 | 33035 | |
| 49 | 21047 | |
| 48 | 11786 | 1.7% |
| 47 | 9259 | 1.4% |
| 46 | 16913 | 2.5% |
| 45 | 11758 | 1.7% |
| 44 | 11659 | 1.7% |
| 43 | 13196 | 1.9% |
| 42 | 13117 | 1.9% |
st_assem
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 65 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.791583 |
| Minimum | 23 |
|---|---|
| Maximum | 87 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 23 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 33 |
| median | 52 |
| Q3 | 64 |
| 95-th percentile | 83 |
| Maximum | 87 |
| Range | 64 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 18.96652 |
|---|---|
| Coefficient of variation (CV) | 0.37341856 |
| Kurtosis | -1.1675283 |
| Mean | 50.791583 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | 0.15900702 |
| Sum | 34730675 |
| Variance | 359.72888 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62 | 46002 | 6.7% |
| 26 | 27763 | 4.1% |
| 64 | 22922 | 3.4% |
| 63 | 21883 | 3.2% |
| 25 | 21514 | 3.1% |
| 33 | 20927 | 3.1% |
| 61 | 17669 | 2.6% |
| 24 | 17643 | 2.6% |
| 23 | 17615 | 2.6% |
| 29 | 17572 | 2.6% |
| Other values (55) | 452278 |
| Value | Count | Frequency (%) |
| 23 | 17615 | |
| 24 | 17643 | |
| 25 | 21514 | |
| 26 | 27763 | |
| 27 | 14853 | |
| 28 | 13789 | |
| 29 | 17572 | |
| 30 | 11689 | |
| 31 | 14451 | |
| 32 | 13443 |
| Value | Count | Frequency (%) |
| 87 | 7428 | |
| 86 | 5530 | |
| 85 | 6543 | |
| 84 | 8253 | |
| 83 | 8808 | |
| 82 | 12887 | |
| 81 | 8537 | |
| 80 | 9314 | |
| 79 | 7083 | |
| 78 | 5182 |
st_senate
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.615781 |
| Minimum | 10 |
|---|---|
| Maximum | 36 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 14 |
| median | 21 |
| Q3 | 25 |
| 95-th percentile | 34 |
| Maximum | 36 |
| Range | 26 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 7.3908438 |
|---|---|
| Coefficient of variation (CV) | 0.35850418 |
| Kurtosis | -0.94839788 |
| Mean | 20.615781 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.29937187 |
| Sum | 14096824 |
| Variance | 54.624573 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 87241 | 12.8% |
| 11 | 67705 | 9.9% |
| 15 | 44624 | 6.5% |
| 14 | 38351 | 5.6% |
| 10 | 35044 | 5.1% |
| 34 | 31066 | 4.5% |
| 19 | 29083 | 4.3% |
| 22 | 27179 | 4.0% |
| 23 | 25401 | 3.7% |
| 16 | 24146 | 3.5% |
| Other values (16) | 273948 |
| Value | Count | Frequency (%) |
| 10 | 35044 | |
| 11 | 67705 | |
| 12 | 21847 | 3.2% |
| 13 | 18827 | 2.8% |
| 14 | 38351 | |
| 15 | 44624 | |
| 16 | 24146 | 3.5% |
| 17 | 22225 | 3.3% |
| 18 | 20603 | 3.0% |
| 19 | 29083 |
| Value | Count | Frequency (%) |
| 36 | 14544 | |
| 34 | 31066 | |
| 33 | 14462 | |
| 32 | 15915 | |
| 31 | 12898 | |
| 30 | 13929 | |
| 29 | 14285 | |
| 28 | 13195 | |
| 27 | 13685 | |
| 26 | 16456 |
nta
Text
| Distinct | 188 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2735152 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | QN37 |
|---|---|
| 2nd row | QN28 |
| 3rd row | QN76 |
| 4th row | BK31 |
| 5th row | QN12 |
| Value | Count | Frequency (%) |
| si01 | 12969 | 1.9% |
| si54 | 10734 | 1.6% |
| qn46 | 9780 | 1.4% |
| bk82 | 9607 | 1.4% |
| si32 | 9251 | 1.4% |
| si05 | 8446 | 1.2% |
| si11 | 8216 | 1.2% |
| qn17 | 7701 | 1.1% |
| qn49 | 7620 | 1.1% |
| bk45 | 7449 | 1.1% |
| Other values (178) | 592015 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 316193 | |
| B | 262277 | 9.6% |
| Q | 250524 | 9.2% |
| 3 | 187685 | 6.9% |
| 2 | 182588 | 6.7% |
| K | 177320 | 6.5% |
| 4 | 174978 | 6.4% |
| 1 | 167614 | 6.1% |
| 5 | 160168 | 5.9% |
| 0 | 138820 | 5.1% |
| Other values (8) | 716985 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1367576 | |
| Decimal Number | 1367576 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 187685 | |
| 2 | 182588 | |
| 4 | 174978 | |
| 1 | 167614 | |
| 5 | 160168 | |
| 0 | 138820 | |
| 7 | 104302 | |
| 6 | 97530 | |
| 8 | 97514 | |
| 9 | 56377 | 4.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 316193 | |
| B | 262277 | |
| Q | 250524 | |
| K | 177320 | |
| S | 105318 | 7.7% |
| I | 105318 | 7.7% |
| X | 84957 | 6.2% |
| M | 65669 | 4.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1367576 | |
| Common | 1367576 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 187685 | |
| 2 | 182588 | |
| 4 | 174978 | |
| 1 | 167614 | |
| 5 | 160168 | |
| 0 | 138820 | |
| 7 | 104302 | |
| 6 | 97530 | |
| 8 | 97514 | |
| 9 | 56377 | 4.1% |
Latin
| Value | Count | Frequency (%) |
| N | 316193 | |
| B | 262277 | |
| Q | 250524 | |
| K | 177320 | |
| S | 105318 | 7.7% |
| I | 105318 | 7.7% |
| X | 84957 | 6.2% |
| M | 65669 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2735152 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 316193 | |
| B | 262277 | 9.6% |
| Q | 250524 | 9.2% |
| 3 | 187685 | 6.9% |
| 2 | 182588 | 6.7% |
| K | 177320 | 6.5% |
| 4 | 174978 | 6.4% |
| 1 | 167614 | 6.1% |
| 5 | 160168 | 5.9% |
| 0 | 138820 | 5.1% |
| Other values (8) | 716985 |
nta_name
Text
| Distinct | 188 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 39 |
| Mean length | 19.836797 |
| Min length | 6 |
Characters and Unicode
| Total characters | 13564164 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Kew Gardens Hills |
|---|---|
| 2nd row | Jackson Heights |
| 3rd row | Baisley Park |
| 4th row | Bay Ridge |
| 5th row | Hammels-Arverne-Edgemere |
| Value | Count | Frequency (%) |
| park | 55089 | 3.7% |
| east | 49319 | 3.3% |
| heights | 48397 | 3.2% |
| hill | 34811 | 2.3% |
| new | 34005 | 2.3% |
| south | 30523 | 2.0% |
| north | 29852 | 2.0% |
| beach | 29126 | 2.0% |
| hills | 28352 | 1.9% |
| village | 26269 | 1.8% |
| Other values (250) | 1126691 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1215359 | 9.0% |
| a | 963637 | 7.1% |
| l | 880073 | 6.5% |
| o | 853763 | 6.3% |
| 808646 | 6.0% | |
| r | 804239 | 5.9% |
| i | 767966 | 5.7% |
| n | 738160 | 5.4% |
| t | 732792 | 5.4% |
| s | 708887 | 5.2% |
| Other values (45) | 5090642 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10289513 | |
| Uppercase Letter | 1967877 | 14.5% |
| Space Separator | 808646 | 6.0% |
| Dash Punctuation | 459153 | 3.4% |
| Other Punctuation | 32993 | 0.2% |
| Open Punctuation | 2991 | < 0.1% |
| Close Punctuation | 2991 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1215359 | |
| a | 963637 | |
| l | 880073 | 8.6% |
| o | 853763 | 8.3% |
| r | 804239 | 7.8% |
| i | 767966 | 7.5% |
| n | 738160 | 7.2% |
| t | 732792 | 7.1% |
| s | 708887 | 6.9% |
| d | 403036 | 3.9% |
| Other values (15) | 2221601 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 254261 | |
| B | 226733 | |
| S | 160874 | 8.2% |
| P | 154900 | 7.9% |
| C | 127859 | 6.5% |
| M | 108390 | 5.5% |
| G | 96399 | 4.9% |
| E | 95949 | 4.9% |
| N | 95605 | 4.9% |
| W | 94221 | 4.8% |
| Other values (14) | 552686 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 16745 | |
| . | 16248 |
Space Separator
| Value | Count | Frequency (%) |
| 808646 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 459153 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2991 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2991 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12257390 | |
| Common | 1306774 | 9.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1215359 | 9.9% |
| a | 963637 | 7.9% |
| l | 880073 | 7.2% |
| o | 853763 | 7.0% |
| r | 804239 | 6.6% |
| i | 767966 | 6.3% |
| n | 738160 | 6.0% |
| t | 732792 | 6.0% |
| s | 708887 | 5.8% |
| d | 403036 | 3.3% |
| Other values (39) | 4189478 |
Common
| Value | Count | Frequency (%) |
| 808646 | ||
| - | 459153 | |
| ' | 16745 | 1.3% |
| . | 16248 | 1.2% |
| ( | 2991 | 0.2% |
| ) | 2991 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13564164 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1215359 | 9.0% |
| a | 963637 | 7.1% |
| l | 880073 | 6.5% |
| o | 853763 | 6.3% |
| 808646 | 6.0% | |
| r | 804239 | 5.9% |
| i | 767966 | 5.7% |
| n | 738160 | 5.4% |
| t | 732792 | 5.4% |
| s | 708887 | 5.2% |
| Other values (45) | 5090642 |
boro_ct
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 2152 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3404914.1 |
| Minimum | 1000201 |
|---|---|
| Maximum | 5032300 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 1000201 |
|---|---|
| 5-th percentile | 1015700 |
| Q1 | 3011700 |
| median | 4008100 |
| Q3 | 4103202 |
| 95-th percentile | 5019800 |
| Maximum | 5032300 |
| Range | 4032099 |
| Interquartile range (IQR) | 1091502 |
Descriptive statistics
| Standard deviation | 1175863.4 |
|---|---|
| Coefficient of variation (CV) | 0.34534305 |
| Kurtosis | -0.53896426 |
| Mean | 3404914.1 |
| Median Absolute Deviation (MAD) | 964100 |
| Skewness | -0.55026943 |
| Sum | 2.3282394 × 1012 |
| Variance | 1.3826548 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5020801 | 3776 | 0.6% |
| 5017600 | 3245 | 0.5% |
| 5020803 | 2749 | 0.4% |
| 5020804 | 2723 | 0.4% |
| 5019800 | 2620 | 0.4% |
| 5022600 | 2452 | 0.4% |
| 5017005 | 2402 | 0.4% |
| 4089200 | 2399 | 0.4% |
| 5024401 | 2392 | 0.3% |
| 5017010 | 2269 | 0.3% |
| Other values (2142) | 656761 |
| Value | Count | Frequency (%) |
| 1000201 | 70 | < 0.1% |
| 1000202 | 217 | |
| 1000600 | 187 | |
| 1000700 | 144 | |
| 1000800 | 288 | |
| 1000900 | 83 | < 0.1% |
| 1001001 | 24 | < 0.1% |
| 1001002 | 56 | < 0.1% |
| 1001200 | 111 | < 0.1% |
| 1001300 | 94 | < 0.1% |
| Value | Count | Frequency (%) |
| 5032300 | 114 | < 0.1% |
| 5031902 | 511 | 0.1% |
| 5031901 | 393 | 0.1% |
| 5030302 | 769 | |
| 5030301 | 572 | 0.1% |
| 5029104 | 1294 | |
| 5029103 | 1636 | |
| 5029102 | 752 | |
| 5027900 | 416 | 0.1% |
| 5027706 | 529 | 0.1% |
state
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| New York |
|---|
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 5470304 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New York |
|---|---|
| 2nd row | New York |
| 3rd row | New York |
| 4th row | New York |
| 5th row | New York |
Common Values
| Value | Count | Frequency (%) |
| New York | 683788 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| new | 683788 | |
| york | 683788 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 683788 | |
| e | 683788 | |
| w | 683788 | |
| 683788 | ||
| Y | 683788 | |
| o | 683788 | |
| r | 683788 | |
| k | 683788 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3418940 | |
| Uppercase Letter | 1367576 | 25.0% |
| Space Separator | 683788 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 683788 | |
| w | 683788 | |
| o | 683788 | |
| r | 683788 | |
| k | 683788 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 683788 | |
| Y | 683788 |
Space Separator
| Value | Count | Frequency (%) |
| 683788 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4786516 | |
| Common | 683788 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 683788 | |
| e | 683788 | |
| w | 683788 | |
| Y | 683788 | |
| o | 683788 | |
| r | 683788 | |
| k | 683788 |
Common
| Value | Count | Frequency (%) |
| 683788 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5470304 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 683788 | |
| e | 683788 | |
| w | 683788 | |
| 683788 | ||
| Y | 683788 | |
| o | 683788 | |
| r | 683788 | |
| k | 683788 |
latitude
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 676080 |
|---|---|
| Distinct (%) | 98.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.701261 |
| Minimum | 40.498466 |
|---|---|
| Maximum | 40.912918 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 40.498466 |
|---|---|
| 5-th percentile | 40.548447 |
| Q1 | 40.631928 |
| median | 40.700612 |
| Q3 | 40.762228 |
| 95-th percentile | 40.856407 |
| Maximum | 40.912918 |
| Range | 0.41445217 |
| Interquartile range (IQR) | 0.13029949 |
Descriptive statistics
| Standard deviation | 0.090311355 |
|---|---|
| Coefficient of variation (CV) | 0.0022188834 |
| Kurtosis | -0.63383411 |
| Mean | 40.701261 |
| Median Absolute Deviation (MAD) | 0.0650372 |
| Skewness | 0.062738079 |
| Sum | 27831034 |
| Variance | 0.0081561409 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.66035938 | 35 | < 0.1% |
| 40.61097709 | 28 | < 0.1% |
| 40.61596593 | 17 | < 0.1% |
| 40.689461 | 17 | < 0.1% |
| 40.77849693 | 11 | < 0.1% |
| 40.66036234 | 9 | < 0.1% |
| 40.8830341 | 9 | < 0.1% |
| 40.85571674 | 8 | < 0.1% |
| 40.7715108 | 8 | < 0.1% |
| 40.78750377 | 7 | < 0.1% |
| Other values (676070) | 683639 |
| Value | Count | Frequency (%) |
| 40.49846614 | 1 | |
| 40.49847126 | 1 | |
| 40.49850958 | 1 | |
| 40.49854295 | 1 | |
| 40.4985987 | 1 | |
| 40.49874956 | 1 | |
| 40.49879352 | 1 | |
| 40.49881226 | 1 | |
| 40.49881678 | 1 | |
| 40.49887148 | 1 |
| Value | Count | Frequency (%) |
| 40.91291831 | 1 | |
| 40.91280676 | 1 | |
| 40.91271785 | 1 | |
| 40.91261439 | 1 | |
| 40.91260541 | 1 | |
| 40.9124346 | 1 | |
| 40.91236829 | 1 | |
| 40.91220869 | 1 | |
| 40.91217396 | 1 | |
| 40.91215184 | 1 |
longitude
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 677101 |
|---|---|
| Distinct (%) | 99.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.92406 |
| Minimum | -74.254965 |
|---|---|
| Maximum | -73.700488 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 683788 |
| Negative (%) | 100.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | -74.254965 |
|---|---|
| 5-th percentile | -74.169931 |
| Q1 | -73.9805 |
| median | -73.912911 |
| Q3 | -73.83491 |
| 95-th percentile | -73.744469 |
| Maximum | -73.700488 |
| Range | 0.55447653 |
| Interquartile range (IQR) | 0.1455898 |
Descriptive statistics
| Standard deviation | 0.12358346 |
|---|---|
| Coefficient of variation (CV) | -0.0016717623 |
| Kurtosis | -0.11879602 |
| Mean | -73.92406 |
| Median Absolute Deviation (MAD) | 0.071501375 |
| Skewness | -0.61207924 |
| Sum | -50548385 |
| Variance | 0.015272871 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -73.76103529 | 35 | < 0.1% |
| -74.15686098 | 29 | < 0.1% |
| -73.75274984 | 17 | < 0.1% |
| -73.95680062 | 17 | < 0.1% |
| -73.79764749 | 11 | < 0.1% |
| -73.90045375 | 9 | < 0.1% |
| -73.76106931 | 9 | < 0.1% |
| -73.91441448 | 8 | < 0.1% |
| -73.91666671 | 8 | < 0.1% |
| -73.85143651 | 7 | < 0.1% |
| Other values (677091) | 683638 |
| Value | Count | Frequency (%) |
| -74.2549647 | 1 | |
| -74.25489452 | 1 | |
| -74.25487627 | 1 | |
| -74.25485662 | 1 | |
| -74.25483416 | 1 | |
| -74.25479222 | 1 | |
| -74.25477504 | 1 | |
| -74.2547486 | 1 | |
| -74.25472217 | 1 | |
| -74.25469573 | 1 |
| Value | Count | Frequency (%) |
| -73.70048817 | 1 | |
| -73.70059248 | 1 | |
| -73.70059368 | 1 | |
| -73.70059651 | 1 | |
| -73.70060124 | 1 | |
| -73.7006054 | 1 | |
| -73.70060622 | 1 | |
| -73.70061034 | 1 | |
| -73.70062065 | 1 | |
| -73.70064005 | 1 |
x_sp
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 681630 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1005279.9 |
| Minimum | 913349.27 |
|---|---|
| Maximum | 1067247.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 913349.27 |
|---|---|
| 5-th percentile | 937034.4 |
| Q1 | 989657.84 |
| median | 1008386.2 |
| Q3 | 1029991.3 |
| 95-th percentile | 1055117.1 |
| Maximum | 1067247.6 |
| Range | 153898.36 |
| Interquartile range (IQR) | 40333.437 |
Descriptive statistics
| Standard deviation | 34285.054 |
|---|---|
| Coefficient of variation (CV) | 0.034104985 |
| Kurtosis | -0.11379428 |
| Mean | 1005279.9 |
| Median Absolute Deviation (MAD) | 19812.179 |
| Skewness | -0.61435668 |
| Sum | 6.8739831 × 1011 |
| Variance | 1.175465 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1050549.648 | 35 | < 0.1% |
| 940697.4014 | 28 | < 0.1% |
| 996243.4593 | 17 | < 0.1% |
| 1052818.477 | 17 | < 0.1% |
| 1040292.358 | 11 | < 0.1% |
| 1014131.162 | 9 | < 0.1% |
| 1050540.207 | 9 | < 0.1% |
| 1011776.478 | 9 | < 0.1% |
| 1007955.765 | 8 | < 0.1% |
| 1007302.748 | 8 | < 0.1% |
| Other values (681620) | 683637 |
| Value | Count | Frequency (%) |
| 913349.2661 | 1 | |
| 913368.6477 | 1 | |
| 913373.6867 | 1 | |
| 913379.1135 | 1 | |
| 913385.3156 | 1 | |
| 913396.8953 | 1 | |
| 913401.6387 | 1 | |
| 913408.9363 | 1 | |
| 913416.2338 | 1 | |
| 913423.5311 | 1 |
| Value | Count | Frequency (%) |
| 1067247.624 | 1 | |
| 1067220.126 | 1 | |
| 1067219.309 | 1 | |
| 1067218.901 | 1 | |
| 1067217.945 | 1 | |
| 1067216.745 | 1 | |
| 1067215.32 | 1 | |
| 1067214.942 | 1 | |
| 1067212.348 | 1 | |
| 1067206.751 | 1 |
y_sp
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 682632 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 194798.42 |
| Minimum | 120973.79 |
|---|---|
| Maximum | 271894.09 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 120973.79 |
|---|---|
| 5-th percentile | 139145.45 |
| Q1 | 169515.15 |
| median | 194560.25 |
| Q3 | 217019.57 |
| 95-th percentile | 251311.16 |
| Maximum | 271894.09 |
| Range | 150920.3 |
| Interquartile range (IQR) | 47504.418 |
Descriptive statistics
| Standard deviation | 32902.061 |
|---|---|
| Coefficient of variation (CV) | 0.16890312 |
| Kurtosis | -0.63530824 |
| Mean | 194798.42 |
| Median Absolute Deviation (MAD) | 23708.383 |
| Skewness | 0.062752468 |
| Sum | 1.3320083 × 1011 |
| Variance | 1.0825456 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 179953.5509 | 35 | < 0.1% |
| 161910.8114 | 28 | < 0.1% |
| 190562.4395 | 17 | < 0.1% |
| 163692.3397 | 17 | < 0.1% |
| 222968.9875 | 11 | < 0.1% |
| 261006.6421 | 9 | < 0.1% |
| 179954.6024 | 9 | < 0.1% |
| 251049.181 | 8 | < 0.1% |
| 220370.5573 | 8 | < 0.1% |
| 202587.9351 | 7 | < 0.1% |
| Other values (682622) | 683639 |
| Value | Count | Frequency (%) |
| 120973.7922 | 1 | |
| 120974.9307 | 1 | |
| 120989.6301 | 1 | |
| 121001.0648 | 1 | |
| 121021.3909 | 1 | |
| 121077.124 | 1 | |
| 121093.1548 | 1 | |
| 121098.6109 | 1 | |
| 121100.9996 | 1 | |
| 121122.1262 | 1 |
| Value | Count | Frequency (%) |
| 271894.0921 | 1 | |
| 271853.4435 | 1 | |
| 271821.042 | 1 | |
| 271783.3394 | 1 | |
| 271780.3538 | 1 | |
| 271718.4015 | 1 | |
| 271694.1909 | 1 | |
| 271635.8193 | 1 | |
| 271623.217 | 1 | |
| 271615.2311 | 1 |
| block_id | boro_ct | borocode | boroname | brch_light | brch_other | brch_shoe | cb_num | cncldist | curb_loc | guards | health | latitude | longitude | root_grate | root_other | root_stone | sidewalk | st_assem | st_senate | status | steward | stump_diam | tree_dbh | trnk_light | trnk_other | trunk_wire | user_type | x_sp | y_sp | zip_city | zipcode | tree_id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| block_id | 1.000 | 0.400 | 0.996 | 0.996 | 0.111 | 0.128 | 0.014 | 0.412 | 0.053 | 0.096 | 0.208 | 0.034 | 0.075 | 0.121 | 0.149 | 0.078 | 0.146 | 0.094 | 0.174 | 0.132 | 0.033 | 0.155 | 0.008 | -0.005 | 0.030 | 0.087 | 0.036 | 0.351 | 0.121 | 0.076 | 0.814 | -0.025 | 0.082 |
| boro_ct | 0.400 | 1.000 | 0.999 | 0.999 | 0.114 | 0.127 | 0.014 | 0.952 | 0.557 | 0.047 | 0.211 | 0.033 | -0.512 | 0.041 | 0.149 | 0.078 | 0.146 | 0.088 | -0.486 | -0.529 | 0.033 | 0.155 | 0.011 | 0.080 | 0.030 | 0.086 | 0.037 | 0.361 | 0.041 | -0.512 | 0.893 | 0.265 | 0.201 |
| borocode | 0.996 | 0.999 | 1.000 | 1.000 | 0.110 | 0.127 | 0.014 | 0.963 | 0.593 | 0.047 | 0.207 | 0.033 | -0.549 | -0.026 | 0.149 | 0.078 | 0.145 | 0.088 | -0.483 | -0.534 | 0.033 | 0.153 | 0.010 | 0.077 | 0.030 | 0.087 | 0.035 | 0.353 | -0.026 | -0.549 | 1.000 | 0.279 | 0.203 |
| boroname | 0.996 | 0.999 | 1.000 | 1.000 | 0.110 | 0.127 | 0.014 | 0.853 | 0.371 | 0.047 | 0.207 | 0.033 | -0.430 | -0.092 | 0.149 | 0.078 | 0.145 | 0.088 | -0.397 | -0.454 | 0.033 | 0.153 | 0.005 | 0.048 | 0.030 | 0.087 | 0.035 | 0.353 | -0.091 | -0.430 | 1.000 | 0.089 | 0.111 |
| brch_light | 0.111 | 0.114 | 0.110 | 0.110 | 1.000 | 0.003 | 0.009 | 0.067 | 0.029 | 0.042 | 0.054 | 0.025 | -0.012 | 0.061 | 0.011 | 0.058 | 0.143 | 0.110 | -0.061 | -0.074 | 0.070 | 0.038 | -0.052 | 0.174 | 0.063 | 0.049 | 0.175 | 0.063 | 0.062 | -0.012 | 0.169 | 0.087 | 0.011 |
| brch_other | 0.128 | 0.127 | 0.127 | 0.127 | 0.003 | 1.000 | 0.012 | -0.104 | -0.067 | 0.006 | 0.070 | 0.152 | 0.050 | -0.033 | 0.039 | 0.165 | 0.059 | 0.051 | 0.047 | 0.057 | 0.042 | 0.070 | -0.031 | 0.014 | 0.012 | 0.230 | 0.032 | 0.137 | -0.033 | 0.050 | 0.136 | -0.055 | -0.083 |
| brch_shoe | 0.014 | 0.014 | 0.014 | 0.014 | 0.009 | 0.012 | 1.000 | -0.016 | -0.008 | 0.002 | 0.002 | 0.007 | 0.009 | -0.003 | 0.003 | 0.009 | 0.017 | 0.009 | 0.009 | 0.008 | 0.005 | 0.000 | -0.004 | 0.011 | 0.004 | 0.014 | 0.004 | 0.012 | -0.003 | 0.009 | 0.017 | -0.005 | -0.009 |
| cb_num | 0.412 | 0.952 | 0.963 | 0.853 | 0.067 | -0.104 | -0.016 | 1.000 | 0.599 | 0.047 | 0.210 | 0.033 | -0.599 | 0.041 | 0.149 | 0.080 | 0.145 | 0.089 | -0.485 | -0.535 | 0.034 | 0.156 | 0.015 | 0.090 | 0.031 | 0.087 | 0.037 | 0.372 | 0.041 | -0.599 | 0.894 | 0.312 | 0.242 |
| cncldist | 0.053 | 0.557 | 0.593 | 0.371 | 0.029 | -0.067 | -0.008 | 0.599 | 1.000 | 0.054 | 0.216 | 0.037 | -0.929 | -0.519 | 0.178 | 0.086 | 0.136 | 0.081 | -0.108 | -0.124 | 0.030 | 0.170 | -0.004 | 0.043 | 0.029 | 0.093 | 0.043 | 0.381 | -0.518 | -0.929 | 0.719 | 0.018 | 0.135 |
| curb_loc | 0.096 | 0.047 | 0.047 | 0.047 | 0.042 | 0.006 | 0.002 | 0.047 | 0.054 | 1.000 | 0.035 | 0.006 | -0.009 | 0.014 | 0.010 | 0.014 | 0.032 | 0.066 | -0.025 | -0.030 | 0.009 | 0.021 | 0.008 | -0.044 | 0.000 | 0.000 | 0.015 | 0.026 | 0.014 | -0.009 | 0.094 | 0.029 | 0.020 |
| guards | 0.208 | 0.211 | 0.207 | 0.207 | 0.054 | 0.070 | 0.002 | 0.210 | 0.216 | 0.035 | 1.000 | 0.021 | -0.086 | 0.139 | 0.055 | 0.088 | 0.093 | 0.048 | -0.144 | -0.166 | 1.000 | 0.324 | NaN | 0.084 | 0.032 | 0.060 | 0.014 | 0.168 | 0.139 | -0.086 | 0.220 | 0.172 | 0.133 |
| health | 0.034 | 0.033 | 0.033 | 0.033 | 0.025 | 0.152 | 0.007 | 0.033 | 0.037 | 0.006 | 0.021 | 1.000 | 0.002 | 0.009 | 0.023 | 0.054 | 0.030 | 0.020 | -0.001 | -0.004 | 1.000 | 0.008 | NaN | -0.007 | 0.008 | 0.135 | 0.028 | 0.025 | 0.009 | 0.002 | 0.074 | 0.002 | 0.072 |
| latitude | 0.075 | -0.512 | -0.549 | -0.430 | -0.012 | 0.050 | 0.009 | -0.599 | -0.929 | -0.009 | -0.086 | 0.002 | 1.000 | 0.511 | 0.080 | 0.059 | 0.114 | 0.079 | 0.153 | 0.165 | 0.026 | 0.061 | 0.000 | -0.035 | 0.016 | 0.072 | 0.032 | 0.286 | 0.510 | 1.000 | 0.585 | -0.028 | -0.136 |
| longitude | 0.121 | 0.041 | -0.026 | -0.092 | 0.061 | -0.033 | -0.003 | 0.041 | -0.519 | 0.014 | 0.139 | 0.009 | 0.511 | 1.000 | 0.085 | 0.081 | 0.155 | 0.092 | -0.554 | -0.551 | 0.027 | 0.136 | 0.028 | 0.080 | 0.021 | 0.092 | 0.031 | 0.378 | 1.000 | 0.511 | 0.636 | 0.714 | 0.192 |
| root_grate | 0.149 | 0.149 | 0.149 | 0.149 | 0.011 | 0.039 | 0.003 | 0.149 | 0.178 | 0.010 | 0.055 | 0.023 | 0.080 | 0.085 | 1.000 | 0.023 | 0.015 | 0.001 | 0.064 | 0.060 | 0.016 | 0.016 | -0.012 | -0.008 | 0.035 | 0.037 | 0.007 | 0.061 | -0.040 | 0.049 | 0.153 | -0.078 | -0.045 |
| root_other | 0.078 | 0.078 | 0.078 | 0.078 | 0.058 | 0.165 | 0.009 | 0.080 | 0.086 | 0.014 | 0.088 | 0.054 | 0.059 | 0.081 | 0.023 | 1.000 | 0.056 | 0.088 | 0.020 | 0.031 | 0.047 | 0.023 | -0.035 | 0.089 | 0.018 | 0.204 | 0.050 | 0.104 | -0.017 | 0.039 | 0.101 | -0.021 | -0.068 |
| root_stone | 0.146 | 0.146 | 0.145 | 0.145 | 0.143 | 0.059 | 0.017 | 0.145 | 0.136 | 0.032 | 0.093 | 0.030 | 0.114 | 0.155 | 0.015 | 0.056 | 1.000 | 0.344 | -0.045 | -0.031 | 0.112 | 0.106 | -0.083 | 0.337 | 0.008 | 0.079 | 0.051 | 0.169 | 0.024 | 0.050 | 0.197 | 0.072 | -0.031 |
| sidewalk | 0.094 | 0.088 | 0.088 | 0.088 | 0.110 | 0.051 | 0.009 | 0.089 | 0.081 | 0.066 | 0.048 | 0.020 | 0.079 | 0.092 | 0.001 | 0.088 | 0.344 | 1.000 | 0.008 | 0.004 | 1.000 | 0.057 | NaN | -0.253 | 0.000 | 0.065 | 0.035 | 0.083 | 0.001 | -0.007 | 0.131 | -0.034 | 0.001 |
| st_assem | 0.174 | -0.486 | -0.483 | -0.397 | -0.061 | 0.047 | 0.009 | -0.485 | -0.108 | -0.025 | -0.144 | -0.001 | 0.153 | -0.554 | 0.064 | 0.020 | -0.045 | 0.008 | 1.000 | 0.912 | 0.037 | 0.120 | -0.032 | -0.129 | 0.027 | 0.078 | 0.042 | 0.329 | -0.554 | 0.152 | 0.670 | -0.838 | -0.219 |
| st_senate | 0.132 | -0.529 | -0.534 | -0.454 | -0.074 | 0.057 | 0.008 | -0.535 | -0.124 | -0.030 | -0.166 | -0.004 | 0.165 | -0.551 | 0.060 | 0.031 | -0.031 | 0.004 | 0.912 | 1.000 | 0.038 | 0.149 | -0.028 | -0.123 | 0.027 | 0.071 | 0.033 | 0.299 | -0.552 | 0.165 | 0.628 | -0.792 | -0.247 |
| status | 0.033 | 0.033 | 0.033 | 0.033 | 0.070 | 0.042 | 0.005 | 0.034 | 0.030 | 0.009 | 1.000 | 1.000 | 0.026 | 0.027 | 0.016 | 0.047 | 0.112 | 1.000 | 0.037 | 0.038 | 1.000 | 1.000 | 0.755 | -0.290 | 0.008 | 0.049 | 0.031 | 0.008 | 0.020 | 0.012 | 0.043 | 0.018 | -0.010 |
| steward | 0.155 | 0.155 | 0.153 | 0.153 | 0.038 | 0.070 | 0.000 | 0.156 | 0.170 | 0.021 | 0.324 | 0.008 | 0.061 | 0.136 | 0.016 | 0.023 | 0.106 | 0.057 | 0.120 | 0.149 | 1.000 | 1.000 | NaN | 0.246 | 0.034 | 0.036 | 0.013 | 0.152 | 0.124 | -0.033 | 0.172 | 0.119 | 0.141 |
| stump_diam | 0.008 | 0.011 | 0.010 | 0.005 | -0.052 | -0.031 | -0.004 | 0.015 | -0.004 | 0.008 | NaN | NaN | 0.000 | 0.028 | -0.012 | -0.035 | -0.083 | NaN | -0.032 | -0.028 | 0.755 | NaN | 1.000 | -0.275 | 0.003 | 0.025 | 0.016 | 0.013 | 0.028 | 0.000 | 0.021 | 0.037 | 0.002 |
| tree_dbh | -0.005 | 0.080 | 0.077 | 0.048 | 0.174 | 0.014 | 0.011 | 0.090 | 0.043 | -0.044 | 0.084 | -0.007 | -0.035 | 0.080 | -0.008 | 0.089 | 0.337 | -0.253 | -0.129 | -0.123 | -0.290 | 0.246 | -0.275 | 1.000 | 0.000 | 0.001 | 0.006 | 0.006 | 0.080 | -0.035 | 0.007 | 0.119 | 0.087 |
| trnk_light | 0.030 | 0.030 | 0.030 | 0.030 | 0.063 | 0.012 | 0.004 | 0.031 | 0.029 | 0.000 | 0.032 | 0.008 | 0.016 | 0.021 | 0.035 | 0.018 | 0.008 | 0.000 | 0.027 | 0.027 | 0.008 | 0.034 | 0.003 | 0.000 | 1.000 | 0.010 | 0.048 | 0.020 | -0.011 | 0.011 | 0.035 | -0.015 | -0.014 |
| trnk_other | 0.087 | 0.086 | 0.087 | 0.087 | 0.049 | 0.230 | 0.014 | 0.087 | 0.093 | 0.000 | 0.060 | 0.135 | 0.072 | 0.092 | 0.037 | 0.204 | 0.079 | 0.065 | 0.078 | 0.071 | 0.049 | 0.036 | 0.025 | 0.001 | 0.010 | 1.000 | 0.042 | 0.154 | -0.010 | 0.046 | 0.153 | -0.012 | -0.065 |
| trunk_wire | 0.036 | 0.037 | 0.035 | 0.035 | 0.175 | 0.032 | 0.004 | 0.037 | 0.043 | 0.015 | 0.014 | 0.028 | 0.032 | 0.031 | 0.007 | 0.050 | 0.051 | 0.035 | 0.042 | 0.033 | 0.031 | 0.013 | 0.016 | 0.006 | 0.048 | 0.042 | 1.000 | 0.048 | 0.011 | 0.002 | 0.051 | 0.020 | -0.010 |
| user_type | 0.351 | 0.361 | 0.353 | 0.353 | 0.063 | 0.137 | 0.012 | 0.372 | 0.381 | 0.026 | 0.168 | 0.025 | 0.286 | 0.378 | 0.061 | 0.104 | 0.169 | 0.083 | 0.329 | 0.299 | 0.008 | 0.152 | 0.013 | 0.006 | 0.020 | 0.154 | 0.048 | 1.000 | 0.049 | 0.240 | 0.450 | 0.100 | -0.258 |
| x_sp | 0.121 | 0.041 | -0.026 | -0.091 | 0.062 | -0.033 | -0.003 | 0.041 | -0.518 | 0.014 | 0.139 | 0.009 | 0.510 | 1.000 | -0.040 | -0.017 | 0.024 | 0.001 | -0.554 | -0.552 | 0.020 | 0.124 | 0.028 | 0.080 | -0.011 | -0.010 | 0.011 | 0.049 | 1.000 | 0.511 | 0.636 | 0.715 | 0.192 |
| y_sp | 0.076 | -0.512 | -0.549 | -0.430 | -0.012 | 0.050 | 0.009 | -0.599 | -0.929 | -0.009 | -0.086 | 0.002 | 1.000 | 0.511 | 0.049 | 0.039 | 0.050 | -0.007 | 0.152 | 0.165 | 0.012 | -0.033 | 0.000 | -0.035 | 0.011 | 0.046 | 0.002 | 0.240 | 0.511 | 1.000 | 0.585 | -0.027 | -0.136 |
| zip_city | 0.814 | 0.893 | 1.000 | 1.000 | 0.169 | 0.136 | 0.017 | 0.894 | 0.719 | 0.094 | 0.220 | 0.074 | 0.585 | 0.636 | 0.153 | 0.101 | 0.197 | 0.131 | 0.670 | 0.628 | 0.043 | 0.172 | 0.021 | 0.007 | 0.035 | 0.153 | 0.051 | 0.450 | 0.636 | 0.585 | 1.000 | -0.090 | 0.073 |
| zipcode | -0.025 | 0.265 | 0.279 | 0.089 | 0.087 | -0.055 | -0.005 | 0.312 | 0.018 | 0.029 | 0.172 | 0.002 | -0.028 | 0.714 | -0.078 | -0.021 | 0.072 | -0.034 | -0.838 | -0.792 | 0.018 | 0.119 | 0.037 | 0.119 | -0.015 | -0.012 | 0.020 | 0.100 | 0.715 | -0.027 | -0.090 | 1.000 | 0.267 |
| tree_id | 0.082 | 0.201 | 0.203 | 0.111 | 0.011 | -0.083 | -0.009 | 0.242 | 0.135 | 0.020 | 0.133 | 0.072 | -0.136 | 0.192 | -0.045 | -0.068 | -0.031 | 0.001 | -0.219 | -0.247 | -0.010 | 0.141 | 0.002 | 0.087 | -0.014 | -0.065 | -0.010 | -0.258 | 0.192 | -0.136 | 0.073 | 0.267 | 1.000 |
| tree_id | block_id | created_at | tree_dbh | stump_diam | curb_loc | status | health | spc_latin | spc_common | steward | guards | sidewalk | user_type | problems | root_stone | root_grate | root_other | trunk_wire | trnk_light | trnk_other | brch_light | brch_shoe | brch_other | address | zipcode | zip_city | cb_num | borocode | boroname | cncldist | st_assem | st_senate | nta | nta_name | boro_ct | state | latitude | longitude | x_sp | y_sp | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 606945 | 305778 | 2016-06-28 | 10 | 0 | OnCurb | Alive | Good | Fraxinus pennsylvanica | green ash | None | None | NoDamage | TreesCount Staff | Stones | Yes | No | No | No | No | No | No | No | No | 76-046 164 STREET | 11366 | Fresh Meadows | 408 | 4 | Queens | 24 | 25 | 14 | QN37 | Kew Gardens Hills | 4125700 | New York | 40.724339 | -73.805180 | 1.038250e+06 | 203232.9417 |
| 1 | 160321 | 341273 | 2015-08-19 | 9 | 0 | OnCurb | Alive | Good | Gleditsia triacanthos var. inermis | honeylocust | None | None | NoDamage | Volunteer | BranchLights | No | No | No | No | No | No | Yes | No | No | 72-020 32 AVENUE | 11370 | East Elmhurst | 403 | 4 | Queens | 25 | 34 | 13 | QN28 | Jackson Heights | 4030902 | New York | 40.756626 | -73.894167 | 1.013571e+06 | 214953.6472 |
| 2 | 541347 | 325281 | 2015-12-30 | 7 | 0 | OnCurb | Alive | Good | Pyrus calleryana | Callery pear | None | None | NoDamage | TreesCount Staff | BranchLights | No | No | No | No | No | No | Yes | No | No | 153-026 119 AVENUE | 11434 | Jamaica | 412 | 4 | Queens | 28 | 32 | 10 | QN76 | Baisley Park | 4028800 | New York | 40.679777 | -73.788463 | 1.042923e+06 | 187008.2671 |
| 3 | 613930 | 203822 | 2016-07-05 | 10 | 0 | OnCurb | Alive | Good | Pyrus calleryana | Callery pear | None | None | NoDamage | TreesCount Staff | None | No | No | No | No | No | No | No | No | No | 89 89 STREET | 11209 | Brooklyn | 310 | 3 | Brooklyn | 43 | 46 | 22 | BK31 | Bay Ridge | 3005000 | New York | 40.622743 | -74.037543 | 9.738279e+05 | 166160.5847 |
| 4 | 18353 | 338911 | 2015-06-13 | 4 | 0 | OnCurb | Alive | Good | Prunus virginiana | 'Schubert' chokecherry | None | None | NoDamage | TreesCount Staff | BranchLights | No | No | No | No | No | No | Yes | No | No | 559 BEACH 68 STREET | 11692 | Arverne | 414 | 4 | Queens | 31 | 31 | 10 | QN12 | Hammels-Arverne-Edgemere | 4095400 | New York | 40.596514 | -73.797622 | 1.040452e+06 | 156667.5017 |
| 5 | 21173 | 108713 | 2015-06-15 | 8 | 0 | OnCurb | Alive | Good | Gleditsia triacanthos var. inermis | honeylocust | None | None | NoDamage | Volunteer | TrunkOtherBranchOther | No | No | No | No | No | Yes | No | No | Yes | 3554 BROADWAY | 10031 | New York | 109 | 1 | Manhattan | 7 | 71 | 31 | MN04 | Hamilton Heights | 1022900 | New York | 40.826887 | -73.949889 | 9.981185e+05 | 240538.5367 |
| 6 | 544698 | 201434 | 2016-01-20 | 2 | 0 | OnCurb | Alive | Fair | Quercus rubra | northern red oak | 1or2 | None | NoDamage | TreesCount Staff | None | No | No | No | No | No | No | No | No | No | 2030 PITKIN AVENUE | 11207 | Brooklyn | 305 | 3 | Brooklyn | 42 | 55 | 19 | BK85 | East New York (Pennsylvania Ave) | 3114400 | New York | 40.671347 | -73.897614 | 1.012652e+06 | 183882.7143 |
| 7 | 546240 | 228778 | 2016-02-06 | 2 | 0 | OnCurb | Alive | Good | Tilia americana | American linden | 1or2 | Helpful | NoDamage | Volunteer | None | No | No | No | No | No | No | No | No | No | 5008 FT HAMILTON PARKWAY | 11219 | Brooklyn | 312 | 3 | Brooklyn | 44 | 48 | 17 | BK88 | Borough Park | 3011400 | New York | 40.637774 | -73.998692 | 9.846129e+05 | 171634.7857 |
| 8 | 646348 | 309729 | 2016-07-29 | 4 | 0 | OnCurb | Alive | Good | Quercus palustris | pin oak | None | None | Damage | TreesCount Staff | None | No | No | No | No | No | No | No | No | No | 85-006 WOODHAVEN BOULEVARD | 11421 | Woodhaven | 409 | 4 | Queens | 32 | 38 | 15 | QN53 | Woodhaven | 4001400 | New York | 40.696668 | -73.853087 | 1.024988e+06 | 193125.4947 |
| 9 | 413812 | 501196 | 2015-11-02 | 5 | 0 | OnCurb | Alive | Good | Ulmus americana | American elm | None | None | Damage | TreesCount Staff | None | No | No | No | No | No | No | No | No | No | 1340 EAST BAY AVENUE | 10474 | Bronx | 202 | 2 | Bronx | 17 | 84 | 34 | BX27 | Hunts Point | 2009300 | New York | 40.808967 | -73.882647 | 1.016736e+06 | 234027.3437 |
| tree_id | block_id | created_at | tree_dbh | stump_diam | curb_loc | status | health | spc_latin | spc_common | steward | guards | sidewalk | user_type | problems | root_stone | root_grate | root_other | trunk_wire | trnk_light | trnk_other | brch_light | brch_shoe | brch_other | address | zipcode | zip_city | cb_num | borocode | boroname | cncldist | st_assem | st_senate | nta | nta_name | boro_ct | state | latitude | longitude | x_sp | y_sp | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 683778 | 447656 | 502843 | 2015-11-11 | 1 | 0 | OnCurb | Alive | Good | Zelkova serrata | Japanese zelkova | None | None | NoDamage | NYC Parks Staff | None | No | No | No | No | No | No | No | No | No | 610 RIVER AVENUE | 10451 | Bronx | 204 | 2 | Bronx | 8 | 77 | 29 | BX63 | West Concourse | 2006300 | New York | 40.821540 | -73.929341 | 1.003807e+06 | 238594.3657 |
| 683779 | 572583 | 300942 | 2016-06-01 | 6 | 0 | OnCurb | Alive | Good | Cornus mas | Cornelian cherry | None | None | NoDamage | Volunteer | None | No | No | No | No | No | No | No | No | No | 28-048 46 STREET | 11103 | Astoria | 401 | 4 | Queens | 22 | 36 | 12 | QN70 | Astoria | 4014700 | New York | 40.762035 | -73.909937 | 1.009200e+06 | 216919.2837 |
| 683780 | 628527 | 317247 | 2016-07-15 | 11 | 0 | OnCurb | Alive | Good | Ginkgo biloba | ginkgo | None | None | NoDamage | TreesCount Staff | StonesTrunkOther | Yes | No | No | No | No | Yes | No | No | No | 114-016 150 AVENUE | 11420 | South Ozone Park | 410 | 4 | Queens | 32 | 31 | 15 | QN55 | South Ozone Park | 4084601 | New York | 40.667726 | -73.826959 | 1.032254e+06 | 182594.3356 |
| 683781 | 309254 | 410715 | 2015-10-11 | 7 | 0 | OnCurb | Alive | Good | Acer platanoides 'Crimson King' | crimson king maple | None | None | NoDamage | TreesCount Staff | Stones | Yes | No | No | No | No | No | No | No | No | 310 KINGHORN STREET | 10312 | Staten Island | 503 | 5 | Staten Island | 51 | 62 | 24 | SI01 | Annadale-Huguenot-Prince's Bay-Eltingville | 5017600 | New York | 40.528899 | -74.168105 | 9.375181e+05 | 132013.5133 |
| 683782 | 171664 | 103174 | 2015-08-24 | 12 | 0 | OnCurb | Alive | Good | Tilia americana | American linden | 1or2 | Helpful | NoDamage | Volunteer | None | No | No | No | No | No | No | No | No | No | 205 AVENUE C | 10009 | New York | 103 | 1 | Manhattan | 2 | 74 | 27 | MN28 | Lower East Side | 1002800 | New York | 40.727348 | -73.976573 | 9.907431e+05 | 204270.0951 |
| 683783 | 237788 | 223344 | 2015-09-19 | 2 | 0 | OnCurb | Alive | Poor | Prunus cerasifera | purple-leaf plum | 1or2 | None | NoDamage | TreesCount Staff | None | No | No | No | No | No | No | No | No | No | 1 BEARD STREET | 11231 | Brooklyn | 306 | 3 | Brooklyn | 38 | 51 | 25 | BK33 | Carroll Gardens-Columbia Street-Red Hook | 3005300 | New York | 40.672566 | -74.011473 | 9.810674e+05 | 184310.4162 |
| 683784 | 249489 | 335314 | 2015-09-23 | 2 | 0 | OnCurb | Dead | NaN | NaN | NaN | NaN | NaN | NaN | NYC Parks Staff | NaN | No | No | No | No | No | No | No | No | No | 87-015 LITTLE NECK PARKWAY | 11001 | Floral Park | 413 | 4 | Queens | 23 | 33 | 11 | QN44 | Glen Oaks-Floral Park-New Hyde Park | 4157903 | New York | 40.730434 | -73.710600 | 1.064458e+06 | 205525.7957 |
| 683785 | 230261 | 230303 | 2015-09-16 | 2 | 0 | OnCurb | Dead | NaN | NaN | NaN | NaN | NaN | NaN | TreesCount Staff | NaN | No | No | No | No | No | No | No | No | No | 644 EAST 8 STREET | 11230 | Brooklyn | 314 | 3 | Brooklyn | 40 | 44 | 17 | BK42 | Flatbush | 3048200 | New York | 40.633890 | -73.969779 | 9.926380e+05 | 170220.9185 |
| 683786 | 623784 | 318368 | 2016-07-12 | 18 | 0 | OnCurb | Alive | Good | Quercus rubra | northern red oak | None | None | Damage | NYC Parks Staff | None | No | No | No | No | No | No | No | No | No | 116-019 125 STREET | 11420 | South Ozone Park | 410 | 4 | Queens | 28 | 31 | 10 | QN55 | South Ozone Park | 4017800 | New York | 40.676190 | -73.813135 | 1.036082e+06 | 185685.7796 |
| 683787 | 139749 | 217836 | 2015-08-12 | 11 | 0 | OnCurb | Alive | Good | Zelkova serrata | Japanese zelkova | None | None | NoDamage | Volunteer | None | No | No | No | No | No | No | No | No | No | 209 SOUTH 2 STREET | 11211 | Brooklyn | 301 | 3 | Brooklyn | 34 | 53 | 18 | BK73 | North Side-South Side | 3052300 | New York | 40.712328 | -73.959385 | 9.955098e+05 | 198799.3400 |